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Abstract. We have designed, fabricated and tested an XY-addressable readout 
system that is specifically tailored for the reading of superconducting flux qubits in 
an integrated circuit that could enable adiabatic quantum optimization. In such a 
system, the flux qubits only need to be read at the end of an adiabatic evolution when 
quantum mechanical tunneling has been suppressed, thus simplifying many aspects of 
the readout process. The readout architecture for an iV-qubit adiabatic quantum 
optimization system comprises N hysteretic dc SQUIDs and N rf SQUID latches 
controlled by + 2 bias lines. The latching elements are coupled to the qubits 

and the dc SQUIDs are then coupled to the latching elements. This readout scheme 
provides two key advantages: First, the latching elements provide exceptional flux 
sensitivity that significantly exceeds what may be achieved by directly coupling the 
flux qubits to the dc SQUIDs using a practical mutual inductance. Second, the states 
of the latching elements are robust against the influence of ac currents generated by the 
switching of the hysteretic dc SQUIDs, thus allowing one to interrogate the latching 
elements repeatedly so as to mitigate the effects of stochastic switching of the dc 
SQUIDs. We demonstrate that it is possible to achieve single qubit errors rates of 
< 10~^ with this readout scheme. We have characterized the system-level performance 
of a 128-qubit readout system and have measured an error probability of 8 x 10^^ in 
the presence of optimal latching element bias conditions. 
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1. Introduction 

Many proposals exist for liow to build superconducting quantum computing systems 
[ll[2], each with their own unique challenges. One such proposal is to enable adiabatic 
quantum optimization (AQO) using networks of inductively coupled rf SQUID flux 
qubits [3]-E]- In AQO, one embeds an optimization problem of interest in the set of 
local flux biases applied to each qubit and in the set of inductive inter-qubit couplings 
available in a given processor architecture. The processor is initialized in a state wherein 
all single qubit tunneling energies are much larger than the energy scales involved in 
the problem of interest. In this case, all qubits will readily relax into their ground states. 
Thereafter, the AQO algorithm for finding an optimal solution proceeds by smoothly 
decreasing in time until it is much less than the energy scales involved in the problem 
of interest. If the resultant evolution of the state of the processor is adiabatic, then the 
final state of the processor will encode an optimal solution to the problem of interest. 
The challenge that is discussed in this article is that of reading the final state of an AQO 
processor. Note that at the end of the AQO algorithm A^ is negligible, thus naturally 
terminating the evolution in a state that is diagonal in the qubits' flux bases. This is 
in contrast to a general purpose gate model (GM) quantum information processor in 
which the final state of the processor could be a superposition state due to appreciable 

Several methods for reading the state of a flux qubit in its flux basis have been 
reported upon in the literature to date. One approach is to inductively couple a qubit 
to a hysteretic dc SQUID and then interrogate the latter device using either a current 
bias ramp [6l[7] or carefully crafted current pulse [8] to discern the bias at which it 
switches into the voltage state, which depends upon the flux imparted to the dc SQUID 
by the qubit. One key advantage to this readout mechanism is physical size: dc SQUIDs 
can be made relatively small, thus improving the prospects for scaling such a readout 
architecture to many qubit processors. 

Furthermore, as shown in this article, one can design scalable readout biasing 
architectures based upon dc SQUIDs that make economical use of a limited number 
of external bias controls. On the other hand, while hysteretic dc SQUIDs were used 
in early GM quantum computation experiments, they have fallen out of favor due to 
on-chip heating and back-action on the qubits. The first of these drawbacks could be 
remedied, in principle, by better thermal design. The second drawback is less of an issue 
for AQO as the quantum computation itself naturally localizes the state of each qubit 
by making A, negligible by the end of the computation, thus inhibiting transitions 
within the qubit. As such, hysteretic dc SQUIDs could still be of use in designing 
scalable readout architectures for AQO processors. Note that readout architectures 
based upon scalable dispersive techniques |9l[T0], which may prove to be essential in 
the development of GM processors, take up significantly more physical space than those 
based upon hysteretic dc SQUIDs but with some work may also be an attractive option 
for an AQO. 
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While the arguments presented above suggest that the relatively simple hysteretic 
dc SQUID readout of flux qubits may be useful in the development of large scalable 
AQO processors, our research has revealed one more drawback of such an approach. 
It is well known that unshunted dc SQUIDs when switched into the voltage state act 
as an on-chip source of microwave radiation. While this feature has been exploited to 
resonantly excite flux qubits in novel quantum mechanics experiments [11], this behavior 
would be highly undesirable in a future functional quantum information processor. We 
have observed that this behavior is particularly problematic in dense superconducting 
circuits as the radiation generated by a dc SQUID can resonantly drive multiple flux 
qubits within its vicinity, thus altering their final states. Part of this issue may simply be 
the choice of rf SQUID parameters as the design pressures in an AQO processor tend to 
favor flux qubits with large geometric inductances so as to facilitate multiple inter-qubit 
inductive couplings. Consequently, in order to achieve appreciable A^, one must lower 
the designed critical current of the rf SQUID, thus yielding a flux qubit with a relatively 
low tunnel barrier over which radiation from dc SQUIDs can drive resonant activation. 
This is clearly unacceptable as the solution to the optimization problem that has been 
posed to the processor is then corrupted by the act of reading. The novel approach that 
we have implemented to remedy this problem is to insert a quantum flux parametron 
(QFP) [T211I3] between each qubit and its dedicated dc SQUID. The QFP is an rf SQUID 
that possesses a small inductance, a large capacitance and a very large critical current. 
Consequently, the QFP possesses a substantially larger tunnel barrieiijl as compared to 
the flux qubit, which makes the state of the QFP robust in the presence of radiation 
emanating from dc SQUIDs in the voltage state. In our architecture, the QFP is used to 
quietly latch the final state of a given flux qubit with minimal back-action. Thereafter, 
the final state of the QFP is read using a hysteretic dc SQUID. Thus, the QFP and dc 
SQUID combination provides a viable route to a scalable readout architecture that is 
suitable for an AQO processor. 

2. Readout Circuit Design 

Figure [1] shows a high level schematic of a small portion of an XY-addressable readout 
system. Since this article is focused upon the readout architecture, we have suppressed 
the details of the flux qubits and their control circuitry and refer the reader to 
Refs. p3|[T5] for information concerning those elements. Each qubit in the circuit 
requires two elements for readout: a dc SQUID and a QFP. Each dc SQUID has two 
shared control lines: a current bias (cbo, cbi, ...) and a flux bias (fbo, fbi, ...). All QFP 
latches share a single activation line ("QFP latch", flux bias referred to as $fatch) ^ 

I For comparison, typical barrier heights in our rf SQUID flux qubits are 460 GHz. We have found 
that doubling the maximum /? of our qubits to 4 (and thus the barrier to closer to 1 THz) removes the 
resonant activation problem. The resulting devices are parametrically similar to typical phase qubit 
rf SQUIDs, which we have measured do not have this problem. Unfortunately, a maximum /3 near 4 
makes requirements on junction asymmetry too strict to be useful. 
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Figure 1. XY addressable readout system schematic. Eaclr dc SQUID Iras two sliared 
control lines: a current bias (cbo, cbi, ■•■) and a flux bias (fbo, fbi, ...). All QFP 
latches share a single activation line ( "QFP latch" ) and a single flux bias line ( "QFP 
flux bias"). The qubits are discussed in detail in Refs. [I41IT5]. 



single flux bias line ("QFP flux bias", flux bias referred to as '^'qfp)- Let the inductance 
and capacitance of the QFP be denoted by Lqfp and Cqfp, respectively. Each QFP is 
coupled to its affiliated qubit (dc SQUID) via a mutual inductance Mqfp qu (Mdc,qfp). 
The top panel of figure |2] shows the physical layout of a portion of the readout system, 
as implemented in the chips discussed in this article. Note that the QFP is galvanically 
attached to the qubit; this is done purely to reduce the contribution of the readout 
to the length and inductance of the qubit. The bottom panel of this figure shows a 
cross-section of our planarized fabrication process that was used to produce the test 
circuits reported upon herein. All results presented in this article were obtained from 
test circuits containing either 8 or 128 flux qubits, as indicated in the text. 

At a high level, the readout circuit functions as follows: During the course of an 
adiabatic quantum computation, all current biases cbx and flux biases fby are set to zero, 
thus ensuring that the dc SQUIDs are in the zero voltage state. In order to decouple 
the QFPs from the flux qubits, the QFP latch line is set to provide $fatch ~ ^of^ to all 
QFPs, thus suppressing their persistent currents to negligible levels. Once the qubits 
have reached their final states at the end of an adiabatic quantum computation, the 
waveform sequence depicted in figure [3] is applied to the readout circuitry. First, the 
QFP latch bias is raised to provide $fatch ~ "^o to all QFPs, thus raising the rf SQUID 
tunnel barriers (see figure H]), which adiabatically changes the state of each QFP from 
being the ground state of a monostable potential to being the groundstate of a bistable 
potential. The direction of the resultant groundstate persistent current flow about the 
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Figure 2. (top) Layout of a portion of an XY readout system. Note that all of the 
elements are shielded with a ground plane. The region shown above is 47x50 /xm^. 
There are four Nb metal layers: BASE (red), WIRA (blue), WIRB (green diagonal 
hashed), and WIRC (beige). The green dotted layer is a Ti/Pt resistor layer. Junctions 
are pink circles, and vias are denoted by crossed rectangles, (bottom) Cross-section of 
the planarized fabrication process. All metal layers have a nominal thickness of 300 
nm and the dielectric thickness is 200 nm. 

body of each QFP is determined by the polarity of the flux imparted by the qubit 
to which each QFP is coupled. At the end of this linear ramp the height of the QFP 
tunnel barriers AU^ip has been raised to a sufficiently high level so as to preclude further 
dynamics (Af/qfp ^ ^/-y/^qfpC^ ^ fcfeT, where T is the temperature of the chip), and 
the final states of the fiux qubits have been latched into the QFPs. Thereafter, a linear 
current ramp is applied to the bias cb^; whose maximum amplitude is significantly lower 
than the maximum dc SQUID switching current amongst the population of dc 
SQUIDs connected in series with cb^;, so that only the dc SQUID with a fiux bias 
applied to its fby line is triggered (see figure [5]). The timing of a voltage arising across 
the dc SQUID is then measured: If the fiux arising from the QFP increases (decreases) 
the total fiux applied to the dc SQUID addressed by cb^ and fbj^, then switching to the 
voltage state will happen sooner (later) (see figure [7]). This difference in timing reveals 
the state of the QFP and therefore the state of the qubit. A more detailed discussion of 
each of the elements in this readout architecture and their operation is presented below. 
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Figure 3. Waveforms for operating the 2x2 XY readout as in figure [TJ cbo and cbi 
are current bias lines carrying linear ramps with typical height of 1-2 /iA. fbo and fbi 
are flux bias lines carrying pulse currents of height 1-2 mA corresponding to pulsing 
from 0.0 to 0.35 $o applied to a dc SQUID. QFP latch line carries roughly 1-2 mA 
signal varying from 0.5 $o (quiescent) to 1 $o (latched) in the QFP. V(cba;) depicts 
schematically the voltage on the current bias lines when a single SQUID in the column 
is switched. The voltage signals shift slightly left and right in time depending on the 
state of the QFP. The voltage detection is done with a simple threshold comparator. 



2.1. Qubit signal 

For the purposes of reading the final state of the fiux qubits in an AQO processor, the 
relevant signal is the magnitude of the qubit persistent current when the qubit tunnel 
barriers have been raised to their maxima. The fiux qubits used in our circuits are 
of the type described in Ref. [13] and had inductances Lg ~ 300 pH, of which 10 pH 
was reserved for readout, and critical currents ~ 2.2 /iA. During readout, the qubit 
(3 = 2nLqI^/^Q ~ 2, corresponding to a circulating current of roughly ~ 2fiA. A 
typical mutual inductance from the qubit into its corresponding QFP is Mqfp^qu ~ 5.4 
pH, leading to a qubit signal as seen by the QFP of 2/^Mqfp qu ~ 10m$o. 

For the qubit parameters cited above, the maximum tunnel barrier height between 
fiux states would be AUq/h ~ 460 GHz. The design pressures that led to this choice 
are discussed in detail in [lU|15] and so we only briefiy repeat the relevant points 
herein. While in theory one could design fiux qubits with larger (and thus a larger 
AUq to provide immunity to any dc SQUID driving signal), asymmetry in the qubit 
junctions would eventually limit circuit performance as an AQO processor by producing 
uncorrectable fiux offsets in the qubit as is varied during annealing [H]. Furthermore, 
typical AQO architectures require a substantial number of inter-qubit couplings [3] which 
necessitates larger Lg than may be considered appropriate in the study of flux qubits 
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for implementing GM hardware. As such, the flux qubits in our circuits tend to possess 
substantially smaller in order to realize appreciable single qubit tunneling energies 
Aq over as broad of a range of annealing parameter as is practical. Consequently, our 
flux qubits appear to be more susceptible to resonant activation by dc SQUID switching 
transients than other flux qubits reported upon in the literature. 



2.2. QFP latch 

The QFP is a 2-junction rf SQUID, which, in comparison to our flux qubits |T] 
possesses a lower inductance Lqfp ^ 65 pH and a substantially higher critical current 
Jqfp ~ 12 /iA. Consequently, the former device has a maximum /3 = 2nLqfpI^^^/^Q = 
2.37, a maximum persistent current Jqfp ~ 10.5 /iA, and a maximum tunnel barrier 
between its two persistent current states on the order of Af/qfp ~ 3.4 THz (compare 
with 0.46 THz for the qubit). The effective 1-dimensional potential energy of a QFP 
with identical Josephson junctions can be written as 



^qfp= -^r^COS — ^ cos ^ , 

+ ^('^qfp-'^qfp)' (1) 

where (p = 27r$qfp/$o is the mean phase across the two Josephson junctions. Note that 
in tuning $fatch from $o/2 to $o that f/qfp(</)) morphs from being monostable to being 
bistable (see figure H]). Raising $fatch slowly adiabatically latches the state of each flux 
qubit into the QFP to which it is coupled for all qubit /QFP pairs in the integrated 
circuit. Adiabaticity is guaranteed here due to the large signal from the qubit and the 
relatively low bandwidth of the QFP latch line which has a rise time of approximately 1 
fis. Adiabaticity is violated during the latching phase only if there is no signal applied 
to the QFP, as the two QFP circulating current states are then degenerate in energy in 
the large (3 limit. Note that if the qubit state were to be unchanged after dc SQUID 
switching, then the QFP could be operated fully reversibly: the QFP could then be 
reset in the presence of the signal in which it was initially latched, so the whole cycle of 
barrier raising and lowering would then be adiabatic. 

One challenge in designing QFPs (or any 2-junction rf SQUID) is that junction 
asymmetry leads to a $fj^tj.h-dependent effective flux offset from zero [H]. This offset 
limits how small one can make Mqfp qu, as one would like the bimodal qubit signal to 
straddle any such flux offset for values of $fatch about which the QFP becomes bistable. 
Assuming a typical fabrication spread of 1% in junction critical currents and a QFP 
maximum /3 ~ 2.5, this offset could be up to 4m$o- For this reason, we had chosen 
Mqfp,qu such that 2JPMqfp,qu ~ 10m$o- 

There are two other features of the QFP latch that make it useful beyond the qubit 
state readout method described thusfar: 
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QFP Potential Energy 




qfp 

Figure 4. Depiction of tlie potential energy of the QFP as the CJ J bias is raised from 
$0/2 toward $0 during the readout process. Model parameters are Lqip = 65 pH and 
/qfp = 12 fiA. The two counter-circulating persistent current states correspond to the 
two minima of the bistable (red) potential energy landscape. The final QFP barrier 
height for ^fatch/'^o = 1 is near 3.5 THz (not shown on this figure). In the example 
shown, the qubit has been assumed to apply -|-3m$o, causing the QFP to latch into 
the right well. The corresponding /3 values of the curves, from monostable (black) to 
bistable (red) are: 0, 0.79, 0.99, 1.18. These are achieved by applying ^f^tch/'^'o = 0-5, 
0.602, 0.629, and 0.657 $0, respectively. 

2.2.1. Linear detection By biasing the QFP such that it is barely monostable, it can 
be used as a preamplifier acting on the signal from the qubit. Realistically, achieving 
appreciable net fiux gain into the dc SQUID requires careful tuning of the QFP and 
dc SQUID. Instead, to use the latching detector as a linear preamplifier, we apply 
fiux feedback in software to the QFP body loop to keep the QFP at its balance point 
where we have a 50% probability of reading a plus or minus signal from the QFP. The 
required fiux feedback is then a linear representation of the signal from the qubit. At 
this balanced population point, one can show that the QFP has minimal back-action on 
the qubit and can be used directly to probe the magnitude of the circulating current of 
the qubit and thus can be used to extract qubit parameters as is described in [H]. 

2.2.2. Shift register readout There is a well developed logic family based on QFPs [T3] . 
and it is straightforward to design a QFP based shift register to reduce the line 
requirements of this readout circuit. We have successfully fabricated and tested an 
8 QFP long shift register driven with a three phase clock and having a dc SQUID 
readout at the end as a prototype of such a readout scheme. 

2.3. dc SQUID 

The method for using a hysteretic dc SQUID for fiux detection is discussed in [16] and 
more generally in |6l|7]. For the implementation discussed herein, current ramp times 
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Figure 5. Schematic Ic vs. applied flux for a typical dc SQUID. Applying a signal via 
the fby line allows a user to shift the operating point on the abscissa for a given row 
of dc SQUIDs. Then when a current bias is applied to the cb^: line, only those squids 
whose critical current Ic has been suppressed will switch. 

were typically 10 — lOO/xs and peak voltages from the dc SQUIDs after switching were 
typically kept to much less than the gap voltage of Nb to reduce heating on chip. The 
mutual inductance from the dc SQUID to the QFP was Mdc,qfp ~ 1.4 pH. 

Basic operation of the XY- addressable array of dc SQUIDs is presented in figure |3l 
To read out the i^^ column and j^^ row QFP, we set the value of the current in the fb^ 
line in figured] to provide a flux bias of approximately 0.35 $o into the dc SQUIDs on 
that row and the value in all other fb^^j lines to be 0. That results in the suppression of 
the critical current for the dc SQUIDs in row j to Ic = Jco cos(7r0.35) = 0.45/co- Then, 
applying a current ramp on the column cbj of maximum amplitude O.SJco will cause 
the target dc SQUID to switch at a time that gives information about the flux coupled 
from the QFP. In other words, the fb^ lines are used to row select by flux biasing all dc 
SQUIDs in the target row to a bias point where a current ramp on the dc SQUID will 
trigger readout (see figure |5]). The cb^^ lines are used as column selects and all other 
current biases are kept at zero. After we have registered an event, the current biases 
are reset to zero, and the process repeated for all i and j. 

For devices with less than a few thousand qubits it is feasible to provide enough 
analog lines to run an XY readout scheme as described. Modifying the XY readout 
scheme by adding another control direction, for example an XYZ readout scheme where 
X and Z are summed together into the dc SQUID flux bias, changes the scaling of 
number of required wires to a cube root in the number of qubits, thereby allowing 
practical scaling to reading tens of thousands of qubits. If margins on the XYZ scheme 
are too low, then we can implement the QFP shift register method described earlier 
which uses a constant number of analog lines. In that case we can use excess analog 
lines to decrease readout time or increase operating margins. 
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2.4- Electronics 

The electronics to run the readout and qubit control were integrated into a compact 
rack system with 128 output channels, a fraction of which were devoted to readout. 
The dedicated readout channels had an amplifier, a tunable comparator chain and 
retriggerable timers in order to detect multiple SQUID switching events per channel. For 
the purposes of characterizing test circuits, we limited the readout sequence to a rate of 
approximately one dc SQUID switching event/100 /xs per channel. Without modifying 
the line bandwidth (3 MHz), this could in principle be sped up to roughly one event/5 /is. 
Further speed-up could be obtained by reading multiple columns simultaneously, but we 
have some experimental evidence that rf isolation between columns needs to be improved 
as dc SQUIDs from different columns can possibly interfere. With further work, one can 
likely reach the bandwidth limitation of the wiring in the refrigerator. For example, if a 
particular chip were to have 32 columns of dc SQUIDs and each line had a bandwidth 
of 3 MHz, one could obtain roughly a 50 Mbit/second readout rate. 

2.5. Power consumption 

The readout process dissipates energy onto the chip whenever a dc SQUID is in its 
normal state. An upper bound on the total energy dissipated by the readout process 
can be estimated as follows: After the dc SQUID switches to the voltage state, the large 
capacitance of the refrigerator wiring and filtering {C = 2nF) starts to charge with a rate 
set by ^ = Iswitch/C, typically ~ 500fj.V/fis. For simplicity, we ignore the first several 
nanoseconds after switching which are dominated by transients and ringing as the lines 
are not matched to the high impedance the dc SQUIDs present at switching. Until the 
dc SQUID reaches a significant fraction of the gap voltage there is httle dissipation on 
chip. For fast operation we typically restrict the charging time to less than two to three 
microseconds to keep the dc SQUID from charging to the gap voltage and generating 
heat. For the data presented in this paper, though, we allowed many microseconds to 
pass (see figure [7]). The resulting dissipated energy is At x 2.8mV x IftA ~ 28fJ. If we 
performed this same inefficient procedure with a 100/is readout cycle, we would obtain 
an average dissipated power during readout of ~ 280pW, which would be well within 
the capabilities of a standard dilution refrigerator to dissipate. 

3. Challenges 

While we routinely fabricate and operate XY readout systems based upon the 
architecture described herein, there were some formidable challenges that we had 
to overcome in order to operate them at their full potential. While the typical 
challenges associated with proper design of cryostat wiring, filtering, and shielding were 
encountered, we focus here on only those issues directly related to the readout scheme. 
First, failure to yield all junctions is typically a soft failure where one loses either a row 
or a column or even a single readout element. Since the readout system discussed herein 
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was typically a part of a large integrated circuit, it was possible to at least study large 
portions of a circuit even in the presence of a few fabrication errors. 

Related to fabrication yield is the issue of critical current variation across a chip. 
For the dc SQUIDs, small variations of critical current are not a significant issue as 
there is plenty of range between OFF and ON switching currents in this XY scheme. If 
one wanted to be even more frugal with bias line count, it would be possible by invoking 
an even higher dimensional scheme, such as XYZ in which one either sums X and Z into 
the dc SQUID flux bias or Y and Z into the current bias. However, such a scheme would 
come at the expense of a reduced ON and OFF switching current ratio in the presence 
of realistic critical current variations. 

Similar to critical current variation, flux offsets of each dc SQUID lead to reduced 
ON/OFF margins as well. In practice, this proved to be the key challenge to operating 
large scale XY readout schemes in our laboratory. We now use a combination of passive 
magnetic shielding and active magnetic field compensation to achieve fields below 1 nT 
perpendicular to the chip. We still typically end up with flux offsets of a few m$o in 
QFPs and dc SQUIDs, which may be due to local magnetic impurities on the on-chip 
wiring. Cross-talk, both between off-chip and on-chip wiring, similarly reduces operating 
margins. 

For the QFPs, asymmetry in the critical currents of their two Josephson junctions 
directly create an apparent flux offset that competes with the qubit signal, as discussed 
previously. Due to the threshold like latching, small differences in critical current do 
very little, while large differences render the device unusable as the qubit signal cannot 
overcome the asymmetry-induced flux bias acting on the QFP. Typical requirements are 
less than 2% junction critical current asymmetry in the QFP in order for the device to 
be operable. 

We have successfully dealt with all of the issues listed above, and do not expect 
further scaling problems up to at least one thousand qubits (32 dc SQUIDs in a column). 

A further subtle effect which is relevant to fast operation of the readout circuitry is 
thermally induced hysteresis. If one tries to perform experiments with a fast repetition 
rate (for tuned dc SQUID ramps, this rate is typically 100 /xs), then the chip temperature 
is affected by dc SQUID heating (see discussion in ^ below). This dc SQUID heating 
depends quite strongly on when the dc SQUID escapes to the voltage state, and therefore 
what signal is applied to the dc SQUID. We have found that to measure any temperature 
dependent phenomena, such as the transition width of a QFP or qubit where one 
simultaneously requires high speed and high accuracy measurements we needed to 
temperature stabilize the mixing chamber of the dilution refrigerator at 35-40 mK (much 
above the < 10 mK base temperature of the refrigerators) and take care to minimize 
the time the dc SQUID stays in the voltage state. 
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Figure 6. Characterization of QFP flux sensitivity. QFP state after latching as a 
function of flux added from the QFP flux bias line. The data (circles with error bars) 
fit to Eq. [5] (solid line), signifying that the detection is thermal noise dominated. The 
transition width is 142 fi^o^ which is to be compared with typical qubit signals of 
10m$o- 

4. Readout errors 

There are two errors that contribute to imperfect readout. The first is the thermal noise 
on the QFP, and the second is the uncertainty in the switching time of the dc SQUID 
caused by the stochastic nature of the quantum tunnehng from the zero voltage state 
to the voltage state pT]. Each of these error mechanisms will be discussed below. 

Examining the first error source, figure [6] shows the measured flux transfer curve 
of a typical QFP device, as obtained by applying a constant flux through the QFP flux 
bias line and then latching the QFP and measuring the probability of the QFP ending 
up in one of its two possible states (labelled 1 and -1) by repeating the measurement 
1024 times at each flux bias point. To understand the source of the QFP transition 
width, it suffices to analyze the dynamics of the two lowest QFP energy levels as the 
barrier Af/qfp between flux states is raised (as in figure H]) in the presence of a thermal 
bath. As the barrier is raised, the tunneling energy Aqfp between localized fiux states 
gets reduced to zero. 

Once AUqfp is high enough such that ksT and the square root of the integrated 
environmental noise spectral density W [18J is larger than the QFP ground to first 
excited state tunneling energy Aqfp, we reach the realm of incoherent macroscopic 
tunneling, as analyzed in |THlll9]. As long as A'^^p/W is fast enough, the system is 
able to stay in thermal equilibrium via incoherent tunneling. At some point A'^^p/W 
becomes small and the population statistics become effectively frozen, corresponding to 
some value of /3qfp ~ 1.1 — 1.2. For typical annealing times (time taken to raise the 
QFP barrier) of 1 /is, this value of Aqfp is approximately 1 MHz, and we can estimate 
the value of the persistent current of the QFP to be /qfp ~ 4.5 ± 0.5/iA at that point to 
convert the abscissa in figure [6] from fiux to energy (2 x<l>q£ x /qfp). By fitting the QFP 
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population as a function of $q£p to the expected functional form in the hmit Aqfp — )• 0; 



we extract an effective T = 47mK. For comparison, the refrigerator was stabilized at 
approximately 40 mK. If we decrease the cooling time after the dc SQUID switching 
event before the QFP latch in the next measurement frame we can characterize the 
cooling of the QFP after the dc SQUID heating event: 



When operated with dc SQUID current biases tuned to minimize heating, or with 
sufficient cooling time such as the effective 47 mK data shown in figure [6l we obtain a 
corresponding flux sensitivity of the QFP of kBT/I^^^ = 142 /i$o- For comparison, the 
qubit signal coupled into the QFP is of order 10 m$o- This results in practically zero 
error probability for the latch. 

The QFP flux uncertainty (142 yU$o) is significantly better than that of its 
accompanying dc SQUID, which was estimated to be 1.8 m$o from the width of its 
switching distribution. This dramatic difference in sensitivity arises from the fact that 
dc SQUID noise results from the stochastic nature of quantum tunneling from the 
zero voltage to finite voltage state with an equivalent temperature of several hundred 
mK, while the QFP annealing step leads to a thermal uncertainty near the refrigerator 
temperature of 40 mK. 

Focusing now on the second error source, figure [7] shows measured switching 
histograms of a single dc SQUID for the case when its respective qubit is ffist initialized 
in one state (red) and then in the other (blue). While the dc SQUID switching 
distributions are characteristic of quantum tunneling and thus asymmetric, we still 
can characterize the dc SQUID sensitivity by the standard deviation of the switching 
histogram. The abscissa can be converted from time to flux by direct measurement 
of the shift in switching time as a known amount of flux is applied to the dc SQUID. 
The resulting dc SQUID sensitivity is 1.8 m$o per read, and the signal from the QFP 
latch is 11.9 m$0; resulting in a measured error probability of the dc SQUID in reading 
the QFP as small as 0.01. This is a reasonably low error probability, but we require 
even lower error rates in order to implement a large scale AQO processor. It is trivial 
to decrease this dc SQUID limited error by taking more samples of the dc SQUID 
switching time since the QFP maintains its state between these samples. The results 
of this repeated reading of the dc SQUID on its resolving power are shown in figure 
[71 After 2 reads, the dc SQUID sensitivity increases to 1.3 m$o and after 4 reads to 
0.9 m$o- At 4 reads we obtain an error probability of 10~^. This error probability 




(2) 



Cooling time (/is) 



Temperature (mK) 



300 
1000 
2000 
4000 



75 ±7 
62 ±6 
55 ±5 
47 ±5 
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Figure 7. Reducing readout errors witli repeated dc SQUID sampling of the QFP 
state. The dc SQUID is interrogated with a 50 /is long linear current bias ramp. The 
red and blue curves correspond to different initialized flux states of the qubit (which 
is then adiabatically transferred to the QFP). The three lines (from thin to thick) 
correspond to 1, 2, and 4 averaged reads of the dc SQUID. Once 4 reads are performed 
we see no overlap in the data set, which was 4 million points. From the thick lines we 
extract an error probability of 10~^ that is quoted in the text. The small steps in the 
switching histograms are due to the resolution of the room temperature electronics. 
The measurements shown in this plot were obtained from one readout on an 8-qubit 
chip. 



includes qubit initialization errors, QFP errors, and dc SQUID errors. This behavior, 
while varying slightly quantitatively, was similar for all 8 readouts on this test chip. 

To demonstrate the performance of the entire readout system, we present the results 
of a system level margin measurement from a 128-qubit chip in figure [H This particular 
chip had two flux biases to provide <l>qfp to the 128 QFPs: The two lines each biased 
64 QFPs, with the two groups partitioned depending upon the physical orientation of 
the QFP body and denoted as "horizontal" or "vertical". We then studied the error 
probability of the readout system as a function of horizontal $qfp,h ^^nd vertical ^qfp^v 
QFP bias. In order to render the readout test independent of the details of quantum 
annealing, we prepared the 128 qubits into a known state by setting all of the inter-qubit 
couplings to zero [16], biasing all qubits using their local flux sources [15] hard to one 
side, and then raising their tunnel barriers. This placed each flux qubit into a known 
state with certainty. We then read the state of each qubit, as described above using 
a single dc SQUID switching measurement. This sequence was repeated 128 times for 
every point on a 2-dimensional grid in ($qfp h; ^qfp,v)"Space. The resultant contours of 
constant error are shown in figure [8] as black lines, as well as the designed margins as a 
blue dashed line. Inside the 1 error contour we observed no errors out of 128 reads of 
the entire 128 qubit system. These measurements alone confirm that the system level 
error rate can be less than 1/128 ~ 0.008 when biased appropriately. The offset of 
the measured margins in the horizontal direction was due to a single QFP with a flux 
offset, either from Josephson junction asymmetry or flux pinning, of — 10m$o- When 
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Figure 8. Measured QFP margins on a 128-qubit system. Three concentric black 
rings represent contours where the system returned 1/128, 4/128 and 9/128 failures, 
respectively, from inside to outside. The dashed blue box indicates the designed 
operating margins. 

this particular readout failed due to the inability of the qubit signal to straddle that 
QFP's degeneracy point, it contributed 1/128 to the total failure rate, thus dominating 
the shape of the inner most contour. The next margin contour then required a much 
larger horizontal flux bias shift to cause a second QFP to fail repeatedly. The further 
slow increase in failure rates is due to a several m$o random distribution of flux offsets 
in the QFPs. 

In order to better quantify the performance of the 128 qubit readout system, 
we biased the QFPs to an operating point near the center of the margin plot, 
("^qfp.h) "^qfp.v) = 0) m$o, and performed 65536 measurements in the manner cited 
above. We observed 5 errors out of 65536 measurements, which then provided an 
estimated system level error probability of 5/65536 ^ 8 x 10~^. The five errors were 
attributed to different readouts and were probably due to the stochastic nature of the 
dc SQUID switching. From these results, we can crudely estimate the per qubit error 
to be on the order of 5/(65536 x 128) 6 x 10"'' for the 128 qubit chip. This could be 
improved upon by simply using repeated dc SQUID reads, as was done with the 8 qubit 
chip, and the final limit upon the readout error probability will likely be determined by 
outlier events not captured in the preceding discussions. 

5. Conclusion 

We have described and implemented a scalable readout scheme that is suitable for 
a superconducting adiabatic quantum optimization system and measured the readout 
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error rate and margins of an 8- and 128-qubit system. We conclude that the single qubit 
readout error rate can be made less than 10~® using the QFP enabled architecture, which 
makes readout errors a non-issue for large scale integrated circuits of the types considered 
herein. We have experimentally demonstrated a system level error rate of 8 x 10^^ using 
a 128 qubit circuit. We do not expect any significant challenges in further scaling this 
architecture to about one thousand qubits. Finally, we have briefly discussed both an 
XYZ and a QFP shift register based solution that would facihtate scaling up to read 
even larger numbers of qubits on a single chip. 
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